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Abstract 

We develop a complete state-space solution to "Wz-optimal decentralized control of poset-causal 
systems with state-feedback. Our solution is based on the exploitation of a key separability property of 
the problem, that enables an efficient computation of the optimal controller by solving a small number 
of uncoupled standard Riccati equations. Our approach gives important insight into the structure of 
optimal controllers, such as controller degree bounds that depend on the structure of the poset. A novel 
element in our state-space characterization of the controller is a remarkable pair of transfer functions, 
that belong to the incidence algebra of the poset, are inverses of each other, and are intimately related to 
prediction of the state along the different paths on the poset. The results are illustrated by a numerical 
example. 

I. Introduction 

Finding computationally efficient algorithms to design decentralized controllers is a challeng- 
ing area of research (see e.g. [17], [4] and the references therein). Current research suggests 
that while the problem is hard in general, certain classes with special information structures 
are tractable via convex optimization techniques. In past work, the authors have argued that 
communication structures modeled by partially ordered sets (or posets) provide a rich class of 
decentralized control systems (which we call poset-causal systems) that are amenable to such an 
approach [17]. Posets have appeared in the control theory literature earlier in the context of team 
theory [10], and specific posets (chains) have been studied in the context of decentralized control 
[25]. Poset-causal systems are also related to the class of systems studied more classically in 
the context of hierarchical systems [11], [9], where abstract notions of hierarchical organization 
of large-scale systems were introduced and their merits were argued for. 
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While it is possible to design optimal decentralized controllers for a fairly large class of systems 
known as quadratically invariant systems in the frequency domain via the Youla parametrization 
[14], there are some important drawbacks with such an approach. Typically Youla domain 
techniques are not computationally efficient, and the degree of optimal controllers synthesized 
with such techniques is not always well-behaved. In addition to computational efficiency, issues 
related to numerical stability also arise. Typically, operations at the transfer function level are 
inherently less stable numerically. Moreover, such approaches typically do not provide insight 
into the structure of the optimal controller. These drawbacks emphasize the need for state-space 
techniques to synthesize optimal decentralized controllers. State-space techniques are usually 
computationally efficient, numerically stable, and provide degree bounds for optimal controllers. 
In our case we will also show that the solution provides important insight into the structure of 
the controller. 

In this paper we consider the problem of designing "Hi optimal decentralized controllers for 
poset-causal systems. The control objective is the design of optimal feedback laws that have 
access to local state information. We emphasize here that different subsystems do not have 
access to the global state, but only the local states of the systems in a sense that will be made 
precise in the next section. The main contributions in the paper are as follows: 

. We show a certain crucial separabiUty property of the problem under consideration. This 
result is outlined in Theorem 2. This makes it possible to decompose the decentralized 
control problem over posets into a collection of standard centralized control problems. 
. We give an explicit state-space solution procedure in Theorem 3. To construct the solution, 
one needs to solve standard Riccati equations (corresponding to the different sub-problems). 
Using the solutions of these Riccati equations, one constructs certain block matrices and 
provides a state-space realization of the controller. 
. We provide bounds on the degree of the optimal controller in terms of a parameter cr^ that 

depends only on the order-theoretic structure of the poset (Corollary 2). 
. In Theorem 4 we briefly describe the structural form of the optimal controller. We introduce 
a novel pair of transfer functions (O, F) which are inverses of each other, and which capture 
the prediction structure in the optimal controller. We call O the propagation filter, it plays 
a role in propagating local signals (such as states) upstream based on local information. We 
call r the differential filter, it corresponds to computation of differential improvement in the 
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prediction of the state at different subsystems. The discussion related to structural aspects 
is brief and informal in this paper and have been formalized in the paper [20]. 
. We state a new and intuitive decomposition of the structure of the optimal controller into 
certain local control laws. 

A. Related Work 

It is well-known that in general decentralized control is a hard problem, and significant research 
efi'orts have been directed towards its many different aspects; see for instance the classical survey 
[15] for many of the earlier results. More recently, Blondel and Tsitsiklis [3] have shown that 
in certain instances, decentralized control problems are computationally intractable, in particular 
they show that the problem of finding bounded-norm, block-diagonal stabiUzing controllers in 
the presence of output-feedback is NP-hard. On the other hand, Voulgaris [25], [26] presented 
several cases where decentralized control problems are amenable to a convex reparametrization 
and therefore computationally tractable. Lall and Rotkowitz generalize these ideas in terms of 
a property called quadratic invariance [14], we discuss connections to their work later. In past 
work [17], we have shown that posets provide a unifying umbrella to describe these tractable 
examples under an appealing theoretical framework. 

Partially ordered sets (posets) are very well studied objects in combinatorics. The associated 
notions of incidence algebras and Galois connections were first studied by Rota [13] in a 
combinatorics setting. Since then, order-theoretic concepts have been used in engineering and 
computer science; we mention a few specific works below. In control theory, ideas from order 
theory have been used in diff"erent ways. Ho and Chu used posets to study team theory problems 
[10]. They were interested in sequential decision making problems where agents must make 
decisions at diff"erent time steps. They study computational and structural properties of optimal 
decision policies when the problems have poset structure. MuUans and Elliot [12] use posets 
to model the notions of time and causality, and study evolution of systems on locally finite 
posets. Wyman [27] has studied time-varying linear-systems evolving on locally finite posets in 
an algebraic framework, including aspects related to realization theory and duality. In computer 
science, Cousot and Cousot used these ideas to develop tools for formal verification of computer 
programs in their seminal paper [7]. Del Vecchio and Murray [24] have used ideas from lattice 
and order theory to construct estimators for continuous states in hybrid systems. 
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More recently, the authors of this paper have initiated a systematic study of decentraUzed 
control problems from the point of view of partial order theory. In [17], we introduce the partial 
order framework and show how several well-known classes of problems such as nested systems 
[25] fit into the partial order framework. In [18], we extend this poset framework to spatio- 
temporal systems and generalize certain results related to the so called "funnel causal systems" 
of Bamieh et. al [2]. In [19], we show that a class of time-delayed systems known to be amenable 
to convex reparametrization [14] also has an underlying poset structure. In that paper, we also 
study the close connections between posets and another class of decentralized control problems 
known as quadratically invariant problems. While this poset framework provides a lens to view 
all these examples in a common intuitive framework, a systematic study of state-space approaches 
has been lacking. 

In an interesting paper by Swigart and Lall [22], the authors consider a state-space approach 
to the "Hi optimal controller synthesis problem over a particular poset with two nodes. Their 
approach is restricted to the finite time horizon setting (although in a subsequent paper [23], they 
extend this to the infinite time horizon setting), and uses a particular decomposition of certain 
optimality conditions. In this setting, they synthesize optimal controllers and provide insight into 
the structure of the optimal controller. These results are also summarized in the thesis [21]. By 
using our new separability condition (which is related to their decomposition property, but which 
we believe to be more fundamental) we significantly generaUze those results in this paper. We 
provide a solution for all posets and for the infinite time horizon. In recent work [14], Rotkowitz 
and Lall proposed a state-space technique to solve optimal control problems for quadratically 
invariant systems (which could be used for poset-causal systems). An important drawback of 
their reformulation is that one would need to solve larger Riccati equations. Our approach for 
poset-causal systems is more efficient computationally. Moreover, our approach also provides 
insight into the structure of the optimal controllers. 

The rest of this paper is organized as follows. In Section n we introduce the necessary 
preliminaries regarding posets, the control theoretic framework and notation. In Section IE we 
describe our solution strategy. In Section IV we present the main results. We devote Section V 
to a discussion of the main results, and their illustration via examples. 



4 



II. Preliminaries 

In this section we introduce some concepts from order theory. Most of these concepts are well 
studied and fairly standard, we refer the reader to [1],[8] for details. 

A. Posets 

Definition 1: A partially ordered set (or posef) V = (P, <) consists of a set P along with a 
binary relation < which has the following properties: 

1) a < a (reflexivity), 

2) a < b and b < a implies a = b (antisymmetry), 

3) a < b and b < c implies a < c (transitivity). 

We will sometimes use the notation a < b to denote the strict order relation a <b but a b. 

In this paper we will deal with finite posets (i.e. \P\ is finite). It is possible to represent a poset 
graphically via a Hasse diagram by representing the transitive reduction of the poset as a graph 
(i.e. by drawing only the minimal order relations graphically, a downward arrow representing 
the relation <, with the remaining order relations being implied by transitivity). 

Example 1: An example of a poset with three elements (i.e., P = {1, 2, 3}) with order relations 
1 < 2 and 1 < 3 is shown in Figure 1(b). 




(a) (b) (c) (d) 

Fig. 1. Hasse diagrams of some posets. 

Let P = (P,<) be a poset and let p e P. We define Ip = {q e P \ p < q} (we call this the 
downstream set). * Let Hp = {q e P \ p < q,q p}. Similarly, let ^p = {q e P \ q < p} (called 

*We have reversed conventions with respect to some of our conference papers, wherein the Hasse diagrams are drawn with 
upward arrows and the set \p corresponds to the set {q e P \ p < q}. The present convention has been adopted to make the 
presentation more intuitive. For example the downstream set at p corresponds to elements drawn lower in the Hasse diagram. 
It also corresponds to the elements that are "in the future" with respect to p, in keeping with the intuition that information in a 
river propagates "downstream". 
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the upstream set), and ftp = il ^ P \ 1 ^ P,<1 p}- We define ifp = ^ ^ I ? ^ ^ p} 
(called the off-stream set); this is the set of uncomparable elements that have no order relation 
with respect to p. Define an interval [z, j] = {peP\i<p< j}. A minimal element of the poset 
is an element p e P such that if q < p for some q e P then q = p. (A maximal element is 
defined analogously). 

In the poset shown in Figure 1(d), il = {1,2,3,4}, whereas HI = {2,3,4}. Similarly TTl = 0, 
T4 = {1,2,3,4}, and TT4 = {1,2,3}. The set U2 = {3}. 

Definition 2: Let P = (P, <) be a poset. Let Q be a ring. The set of all functions f : PxP 

Q with the property that f(x, y) = if j x is called the incidence algebra of V over Q. It is 
denoted by J(:P). * 

When the poset P is finite, the elements in the incidence algebra may be thought of as matrices 
with a specific sparsity pattern given by the order relations of the poset in the following way. 
One indexes the rows and columns of the matrices by the elements of P. Then the (/, 7) entry 
for /, j e P of the matrix corresponds to /(/, j). By Definition 2, if j ^ / then the (/, j) entry of 
the matrix must be zero. An example of an element of I(P) for the poset from Example 1 (Fig. 
1(b)) is: 



1 
1 1 
1 1 



Given two functions /, g e ICP), their sum f+g and scalar multiplication cf are defined as usual. 
The product h = f • g is defined by h(x,y) = 'Zzepf(-^^z)g(z,y). Note that the above definition of 
function multiplication is made so that it is consistent with standard matrix multiplication. 

Lemma 1: Let ^ be a poset. Under the usual definition of addition and multiplication as 
defined in (1) the incidence algebra is an associative algebra (i.e. it is closed under addition, 
scalar multiplication and function multiplication). 

Proof: The proof is standard, see for example [17]. ■ 

•Standard definitions of the incidence algebra use an opposite convention, namely f(x,y) = if x ^ y. Thus, the matrix 
representation of the incidence algebra is typically a transposal of the matrix representations that appear here. For example, 
while the incidence algebra of a chain is the set of lower-triangular matrices in this paper, in standard treatments it would appear 
as upper-triangular matrices. We reverse the convention so that in a control theoretic setting one may interpret such matrices as 
representing poset-causal maps. 
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Given / < j, let [/ j] denote the set of all chains from / to j of the form {/, /i}, . . . , {ik, j} such 
that i < h < • • • < ik < j- For example, in the poset in Fig. 1(c), [1^3] = {{{1,2} , {2, 3}} ,{1,3}}. 
A standard corollary of Lemma 1 is the following. 

Corollary 1: Suppose A 6 TCP). Then A is invertible if and only if An is invertible for all 
/ G P. Furthermore A~^ G and the inverse is given by: 



llp.MHi] nmep.,i-AikA-^l) if i ^ j 
A^ if i = j. 



B. Control Theoretic Preliminaries 

We consider the following state-space system in continuous time: 



x{t) = Axit) + Fw{t) + Bu{t) 

(1) 

z{t) = Cx(t) + Du{t). 

In this paper we present the continuous time case only, however, we wish to emphasize that 
analogous results hold in discrete time in a straightforward manner. In this paper we consider 
what we will call poset-causal systems. We think of the system matrices (A,B,C,D,F) to be 
partitioned into blocks in the following natural way. Let P = (P, <) be a poset with P = {l,...,p}. 
We think of this system as being divided into p sub-systems, with sub-system / having some states 
Xi(t) G R"', and control inputs m,(0 g R*"' for / g (1, . . . ,/?}. The external output is z(t) G R^ The 
signal w(t) is a disturbance signal. (To use certain standard state-space factorization results, we 
assume that C^D = and D^D > 0, these assumptions can be relaxed in a straightforward way). 
The states and inputs are partitioned in the natural way such that the sub-systems correspond to 
elements of the poset P with x(t) = ^xi(t) \x2(t) \. . . \xp(t) j , and u(t) = ^ui(t) \u2(t) |. . . \up(t) j . 
This naturally partitions the matrices A, B, C, D, F into appropriate blocks so that A = |^A/^ j . 
^ = [^'j\ij.p' ^ = [^j]j.p (P^itio^ed into columns), D = [Dj\,^^, F = [f,,]. (We will 
throughout deal with matrices at this block-matrix level, so that A,-y will unambiguously mean 
the (/, j) block of the matrix A.) Using these block partitions, one can define the incidence algebra 
at the block matrix level in the natural way. We denote by Ia('P),Ib{'P) the block incidence 
algebras corresponding to the block partitions of A and B. 
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We will further assume that F is block diagonal and full column rank. ^ Often, matrices will 
have different (but compatible) dimensions and the block structure will be clear from the context. 
In these cases, we will abuse notation and will drop the subscript and simply write 

Definition 3: We say that a state-space system is f-poset-causal (or simply poset-causal) 
if A e Ia(P) and Be Ib(P). 

Example 2: We use this example to illustrate ideas and concepts throughout this paper. Con- 
sider the system 

x(t) = Ax(t) + Fw(t) + Bu(t) 
z(t) = Cx(t) + Du(t) 
y(t) = x(t), 

with matrices 

' 1 
110 
10 10 
1111 

q4x4 



c = 



-0.5 











-1 


-0.25 








-1 





-0.2 





-1 


-1 


-1 


-0.1 


y4x4 
q4x4 






F = 



B 



(2) 



D = 



t4x4 



(3) 



This system is poset-causal with the underlying poset described in Fig. 1(d). Note that in this 
system, each subsystem has a single input, a single output and a single state. The matrices A 
and B are in the incidence algebra of the poset. Furthermore, F = I. 

Recall that the standard notion of causality in systems theory is based crucially on an under- 
lying totally ordered index set (time). Systems (in LTI theory these are described by impulse 
responses) are said to be causal if the support of the impulse response is consistent with the 
ordering of the index set: an impulse at time zero is only allowed to propagate in the increasing 
direction with respect to the ordering. This notion of causality can be readily extended to 



^More generally we can assume that for the system under consideration (1), F e J(!P) and the diagonal blocks are full column 
rank. Operating under this assumption, one can perform an invertible coordinate transformation T £ 1(f) on the states so that 
T~^F is block diagonal. Since T can be chosen to be in the incidence algebra, T'^AT, T'^B £ J(f ). Hence, without loss of 
generality, we assume that F is block diagonal. 
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situations where the underlying index set is only partially ordered. Indeed this abstract setup 
has been studied by MuUans and Elliott [12], and an interesting algebraic theory of systems has 
been developed. 

Our notion of poset-causality is very much in the same spirit. We call such systems poset- 
causal due to the following analogous property among the sub-systems. If an input is applied 
to sub-system / via w, at some time t, the effect of the input is seen by the states xj for all 
sub-systems j e ii (at or after time 0- Thus ii may be seen as the cone of influence of input /. 
We refer to this causality-like property as poset-causality. This notion of causality enforces (in 
addition to causality with respect to time), causality with respect to the subsystems via a poset. 
For most of this paper we will deal with systems that are poset-causal (with respect to some 
arbitrary but fixed finite poset P). Before we turn to the problem of optimal control we state an 
important result regarding stabilizability of poset-causal systems by poset-causal controllers. 

Theorem 1: The poset-causal system (1) is stabilizable by a poset-causal controller K e 
if and only if the (A„, 5„) are stabilizable for all / G P. 

Proof: See Appendix. ■ 
In this paper, we make the following important assumption about the stabilizability of the sub- 
systems. By the preceding theorem, this assumption is necessary and sufficient to ensure that 
the systems under consideration have feasible controllers. 

Assumption 1: Given the poset-causal system of the form (1), we assume that the sub-systems 
(Aii,Bii) are stabilizable for all / G {1, . . . ,/?}. 

In the absence of this assumption, there is no poset-causal stabilizing controller, and hence 
the problem of finding an optimal one becomes vacuous. This assumption is thus necessary and 
sufficient for the problem to be well-posed. Moreover, in what follows, we will need the solution 
of certain standard Riccati equations. Assumption 1 ensures that all of these Riccati equations 
have well-defined stabilizing solutions. This stabilizing property of the Riccati solutions will be 
useful for proving internal stability of the closed loop system. 

The system (1) may be viewed as a map from the inputs w, u to outputs z,x via 

z = Pnw + Pnu 

X = P21W + P22U 
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where 

Cisl-A)-^F C{sl-A)-^B + D 
(sI-A)-^F isI-A)-^B 



A 


F 


B 


C 





D 


I 









A controller u = Kx induces a map T^^ from the disturbance input w to the exogenous output z 
via 

T,^ = Pn+Pi2K{I-P22Kr'P2,. 

Thus, after the controller is interconnected with the system, the closed-loop map is T,„. The 
objective function of interest is to minimize the 'Ki norm [30] of T^^ which we denote by \\T^w\\. 

C. Information Constraints on the Controller 

Given the system (1), we are interested in designing a controller K that meets certain spec- 
ifications. In traditional control problems, one requires K to be proper, causal and stabilizing. 
One can impose additional constraints on the controller, for example require it to belong to 
some subspace. Such seemingly mild requirements can actually make the problem significantly 
more challenging. This paper focuses on addressing the challenge posed by subspace constraints 
arising from particular decentrahzation structures. The decentralization constraint of interest in 
this paper is one where the controller mirrors the structure of the plant, and is therefore also in 
the block incidence algebra Ik{'P) (we will henceforth drop the subscripts and simply refer to 
the incidence algebra TCP)). This translates into the requirement that input Ui only has access 
to Xj for j e p thereby enforcing poset-causality constraints also on the controller. In this sense 
the controller has access to local states, and we thus refer to it as a decentralized state-feedback 
controller. 



Pn Pi2 
Pi\ Pii 
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D. Problem Statement 

Given the poset-causal system (4) with poset T = (P, <), \P\ = p, solve the optimization 
problem: 

minimize ||Pii + PnK(I - PiiKT'Piif 

K 

subject to K€ (5) 
K stabilizing. 

The main problem under consideration is to solve the above stated optimal control problem 
in the controller variable K. The feasible set is the set of all rational proper transfer function 
matrices that internally stabilize the system (1). In the absence of the decentralization constraints 
K G this is a standard, well-studied control problem that has an efficient finite-dimensional 
state-space solution [30]. The main objective of this paper is to construct such a solution for 
the poset-causal case. 

E. Notation 

Given a matrix Q, let Q(j) denote the 7* column of Q. We denote the i''^ component of the 
vector Q(j) to be Q(j)i. For a poset P with incidence algebra if M e then recall 

that M is sparse, i.e. has a zero pattern given by Mtj = if 7 ^ /. We denote the sparsity pattern 
of the column of the matrices in TCP) by I(Py. More precisely if v is a vector (of length 
equal to that of the matrix M) whose components are denoted by v, then 

nry ■.= {v\vi = o for jii]. 

In the above definition v is understood to be a vector composed of |P| blocks, with sparsity being 
enforced at the block level. 

Given the data (A, B, C, D), we will often need to consider sub-matrices or embed a sub-matrix 
into a full dimensional matrix by zero padding. Some notation for that purpose we will use is 
the following: 

1) Define Q^^ = [Qij]i€ij (so that it is the 7* column shortened to include only the nonzero 
entries). 

2) Also define A(ij) = [A(?)],ei; so that it is the sub-matrix of A containing exactly those 
columns corresponding to the set X7. 
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3) Define A(i j, i j) = [A^i^i^^j so that it is the (i j, i j) sub-matrix of A (containing exactly 
those rows and columns corresponding to the set X 7). 

4) Sometimes, given a block x matrix we will need to embed it into a block matrix 
indexed by the original poset (i.e. a p x p matrix) by padding it with zeroes. Given K (a 
block \ij\x matrix) we define: 



[^]/.m = 



Kim if /, m 6 ij 
otherwise. 



5) = [ ... / ... ]^ be the tall block matrix (indexed with the elements of the 
poset) with an identity in the i''^ block row. 

6) Let Sep. Define Es = [£■,], es. Note that given a block pxp matrix M, ME^j = M(ij) 
is a matrix containing the columns indexed by i j. 

7) Given matrices A,, / G P, we define the block diagonal matrix: 



diag(Ai) = 



Recall that every poset P has a linear extension (i.e. a total order on P that is consistent with 
the partial order <). For convenience, we fix such a linear extension of P, and all indexing of 
our matrices throughout the paper will be consistent with this linear extension (so that elements 
of the incidence algebra are lower triangular). 

Example 3: Let P be the poset shown in Fig. 1(d). We continue with Example 2 to illustrate 
notation. (Note that 12 = {2, 4}). As per the notation defined above. 



A^2 = 













-.25 




-0.25 







-0.25 




A(i2) = 






A(i2,i2) = 




-1 












-1 -0.1 




-1 


-0.1 





12 







1 2 



,then kill, 12) = 



10 2 



Also, if Kill, 12) = 



3 4 







3 4 
in. Solution Strategy 

In this section we first remind the reader of a standard reparametrization of the problem known 
as the Youla parametrization. Using this reparametrization, we illustrate the main technical idea 
of this paper using an example. 

A. Reparametrization 

Problem (5) as stated has a nonconvex objective function. Typically [14], [17], this is con- 
vexified by a bijective change of parameters given by R := K{I - P22K)~^ (though one typically 
needs to make a stabiUty or prestabilization assumption). When the sparsity constraints are 
poset-causal (or quadratically invariant, more generally), this change of parameters preserves the 
sparsity constraints, and R inherits the sparsity constraints of K. The resulting infinite-dimensional 
problem is convex in R. 

For poset-causal systems with state-feedback we will use a slightly different parametrization. 
We note that for poset-causal systems, the matrices A and B are both in the block incidence 
algebra. As a consequence of (4), P21 and P22 are also in the incidence algebra. This structure, 
which follows from the closure properties of an incidence algebra, will be extensively used. 
Since ^21,^22 G J^(^) the optimization problem (5) maybe be reparametrized as follows. Set 



where is the pseudoinverse of F, so that F^F = I (note also that the pseudoinverse is block 
diagonal and hence in J(!P)). As a consequence, given Q, K can be recovered using 



Q=K(I-P22K)-'P21. 



(6) 



Note that P21 is left invertible, and a left inverse is given by 



Pi, =F\sI-A), 



K = QPUi + P22QPIX' 



(7) 
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Since I,P2\,P\i,P2i all lie in the incidence algebra, K G J(^) if and only if Q & TCP). Using 
this reparametrization the optimization problem (5) can be relaxed to: 

minimize ||Pii + PnQf 

Q (8) 
subject to 2 G TCP). 

Remarks 1) We note that pI^, and hence (7) may potentially be improper. However, we will 
prove that for the optimal Q in (8), this expression is proper and corresponds to a rational 
controller K* G 

2) For the objective function to be bounded, the optimal Q would have to render Pn + PuQ 
stable. However, one also requires that the overall system is internally stable. We relax this 
requirement on Q and later show that K* is nevertheless internally stabilizing. Thus (8) is 
in fact a relaxation of (5). We show that the solution of the relaxation actually corresponds 
to a feasible controller. 

We would like to emphasize the very important role played by the availabiUty of full state- 
feedback. As a consequence of state-feedback, we have that P21 = (si - A)~^F. Thus P21 is left 
invertible (though the inverse is improper), and in the (block) incidence algebra. It is this very 
important feature of P21 that allows us to use this modified parametrization mentioned (6) in 
the preceding paragraph. This parametrization enables us to rewrite the problem in the form (8). 
This form will turn out to be crucial to our main separability result (Theorem 2), which enables 
us to separate the decentralized problem into a set of decoupled centralized problems. 

A main step in our solution strategy will be to reduce the optimal control problem to a set 
of standard centralized control problems, whose solutions may be obtained by solving standard 
Riccati equations. The key result about centralized "Hi optimal control is as follows. 

Lemma 2: Consider a system H given by 



H = 



Hn H12 



Ah 


Fh 


Bh 


Ch 





Dh 



along with the following optimal control problem: 



minimize \\Hn + HnQlf' 

Q (9) 

subject to Q stable. 
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Suppose the pair (Ajj,Bh) is stabilizable, C^Dh = 0, and D^Dh > 0. Then the following Riccati 
equation has a unique symmetric and positive definite solution X: 



AlX + XAh - XBh{DIDh)-'bIX + CICh = 0. 



Let L be obtained from this unique positive definite solution via: 

L = {dIDhY'bIx. 

Then the optimal solution to (9) is given by: 

Q 



(10) 



(11) 



Ah - BhL 


Fh 


-L 






(12) 



(We will often refer to the trio of equations (10), (11), (12) by (L, Q) = Ric(H).) 

Proof: The proof is based on standard techniques and can be argued via a completion-of- 
squares argument. In particular, it follows from the solution to the standard 'H2 optimal control 
problem [29, Theorem 14.7]. Using this theorem, the solution to the "Ka optimal control problem 
for the standard problem with the data 



G = 



Gn G12 
I 



Ah 


Fh Bh 


Ch 


Dh 





I 



gives the required formula. 



B. Separability of Optimal Control Problem 

We next illustrate the main solution strategy via a simple example. Consider the decentralized 
control problem (5) for the poset in Fig. 1(b). Using the reformulation (8) the optimal control 
problem (5) may be recast as: 

2 



mimmize 

Q 



Pn+P 



12 



Qn 

221 222 
Q3I (233 
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Note that Puiil) = Pi2,Pn(i2) = Pn(2) (second column of P12), and Puii^) = PnOl 



Similarly Q^^ = 



Qn Ql. Qlx 



, Q^^ = Q22, and Q^^ = Q33. Due to the column-wise 



separability of the Hi norm, the problem can be recast as: 



mimmize 
Q 



Pna) + Pi2a) 



Qn 
Q21 
Q31 



+ iiPii(2) + Pi2a2)!222ir 



+ ||Pn(3) + Pi2a3)(233ir 

Since the sets of variables appearing in each of the three quadratic terms is different, the problem 
now may be decoupled into three separate sub-problems, each of which is a standard centralized 
control problem. For instance, the solution to the second sub-problem can be obtained by noting 
the realizations of Pii(2) and Pi2(2) and then using (12). In this instance. 



= Ric ([ Pii(2) Pi2(i2) ]) = Ric 



A22 


F22 B22 


C(2) 


D(2) 



In a similar way, the entire matrix Q* can be obtained, and by design Q* G (and is 

stabilizing). To obtain the optimal K*, one can use (7). In fact, it is possible to give an explicit 
state-space formula for K*, this is the main content of Theorem 3 in the next section. 

IV. Main Results 

In this section, we present the main results of the paper. The proofs are available in Section 



VI. 



A. Problem Decomposition and Computational Procedure 

Theorem 2 (Decomposition Theorem): Let ^ be a poset and be its incidence algebra. 
Consider a poset-causal system given by (4). The problem (8) is equivalent to the following set 
of \P\ independent decoupled problems: 

minimize \\PnU) + PuUm^'f "^j^P (13) 

Qlj 

Theorem 2 is essentially the first step towards a state-space solution. The advantage of this 
equivalent reformulation of the problem is that we now have p = \P\ sub-problems, each over 
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a different set of variables (thus the problem is decomposed). Moreover, each sub-problem 
corresponds to a particular standard centralized control problem, and thus the optimal Q in 
(5) can be computed by simply solving each of these sub-problems. 

The subproblems described in (13) have the following interpretation. Once a controller K, or 
equivalently Q is chosen a map T^^ from the exogenous inputs w to the outputs z is induced. Let 
us denote by T^(l) to be the map from the first input zi to all the outputs w (this corresponds 
to the first column of T^). Similarly, the map from Zi to w for / G P is given by T^{i). These 
subproblems correspond to the computation of the optimal maps T*yv{i) for all / e P from the 
z"' input Zi to the output w. The decomposability of the 'Hj norm implies that these maps may 
be computed separately, and the performance of the overall system is simply the aggregation of 
these individual maps. 

Our next theorem provides an efficient computational technique to obtain the required state- 
space solution. To obtain the solution, one needs to solve Riccati equations corresponding to the 
sub-problems we saw in Theorem 2. We combine these solutions to form certain simple block 
matrices, and after simple LFT transformations, one obtains the optimal controller K*. 

Before we state the theorem, we introduce some relevant notation. 
Definition 4: We define the operator Ricdy) for 7 e P by: 

Ric(i7) := Ric 

We define K{ij,ij) via {K{ij,ij), Q{j)) = Ricdj) for j e P. We introduce two matrices related 
to the above solution, namely: 

A = diag(A(i7, ij) - B{ij, ijWaj, ij)) 
K = diag(i^(Ui7)). 



Mij, ij) 

caj) 



Fjj BilMj) 



Diij) 



(14) 
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We will see later on that A is the closed-loop state transition matrix under a particular indexing 
of the states. We introduce three matrices related to structure of the poset, namely: 



Hi = 



El ... 
El ... 



U2 = diag I 




E2 



El 

Em 



(15) 



])• 



R = 



Hi 



Hp 



(In Hi, the f'^ diagonal block Ei has \i j\ number of block rows. To be precise, it is a (Y^keu nk)'Xnj 
matrix with the first nj x tij block as the identity and the rest zeroes.) These matrices also have a 
natural interpretation. In writing the overall states of the closed loop in vector form, we first write 
the states of subsystem 1 (i.e. xi) of the plant, then the states of the controller for subsystem 1 
(i.e. ^(1)), then subsystem 2 plant states and controller states, and so on. In this indexing. Hi is 
a projection operator that projects onto the coordinates of all the state variables xi,...,X2. II2 
is simply the matrix that projects onto the orthogonal complement, i.e. the controller variables 
^(1), . . . , q(p). The optimal controller and other related objects can be expressed in terms of the 
following matrices: 





= njAn 




= n^An 


Co 




Cq 


= -RK. 



(16) 



We illustrate this notation further by means of a numerical example in Appendix A. 

Theorem 3 ( Computation of Optimal Controller): Consider the poset-causal system of the form 
(4), with (An, Bii) stabilizable for all / G P. Consider the following Riccati equations: 



(Kiij, ij), QU)) = Ric (if, V7 e P. 
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Then the optimal solution to the problem (5) is given by the controller: 



K* = 



— B(^Cq 



(17) 



Ce(n2 - Hi Co) 

Moreover, the controller K* e TCP) and is internally stabilizing. 

Recall that denotes the degree of the f'' sub-system in (1). Let Hmax = max/n, be the largest 
degree of the sub-systems. Let n(iii) = Z^eU; ^J- '^'P ~ ^j^p J\ (i^ote that this is a purely 
combinatorial quantity, dependent only on the poset). As we mentioned in the introduction, one 
of the advantages of state-space techniques is that they provide graceful degree bounds for the 
optimal controller. As a consequence of Theorem 3 we have the following: 

Corollary 2 (Degree Bounds): The degree cIk* of the overall optimal controller is bounded 
above by 

dK' <Yj<iij)- 

In particular, dx* < crpUaiia- Moreover, the degree of the controller implemented by subsystem j 
is bounded above by n(iij). 



B. Structure of the Optimal Controller 

Having established the computational aspects, we now turn to some structural aspects of 
the optimal controller. We first introduce a pair of very important objects (O, F), called the 
propagation filter and the differential filter, respectively. Define the block pxp transfer function 
matrices (<l>,r) via: 



Ao 


5(D 


T = 


A(j) — B(pC(^ 


Bq, 










Co 


I 




—Co 


I 



Note that both O and F are invertible (since their "D" matrices are equal to /), and in fact, 
they are inverses of each other, i.e., FO = OF = I. We sometimes denote the entries O,^ = 0,^y 

and similarly F,y = F;^y to emphasize a certain interpretation of these quantities. We note that 
<I),<_; = F;^, = / (this can be seen from the fact that the corresponding entries in the "C" matrices 
of the transfer functions is zero). We show (Lemma 5 in Section VI), that O, F G J(P). Moreover, 
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the fact that O ^ = F in conjunction with Corollary 1 gives the following expression: 

We will show that 0/<_jt, in fact, corresponds to a specific filter that propagates local signals 
upstream. For example, in Fig. 1(a), if xi is the state at subsystem 1, O21X1 is the prediction 
of state X2 at subsystem 1. On the other hand, F has an interesting dual interpretation. As one 
proceeds "upstream" through the poset, more information is available, and consequently the 
prediction of the global state becomes more accurate. The transfer function F plays the role of 
computing the differential improvement in the prediction of the global state. For this reason, 
we call it the differential filter. Interestingly, it is intimately related to the notion of Mobius 
inversion on a poset, a generalization of differentiation to posets. We briefly discuss these ideas 
in the ensuing discussion. Before stating the next theorem, we introduce the transfer function 
matrix K(s,, which is defined column-wise via: 

K^{j) = k{ij,iimi). (20) 

Theorem 4 (Structure of Optimal Controller): The optimal controller (17) is of the form: 

m(0 = -K^Yx{t) 

= -YjK{ihiimiWx)j{t). 

Remark Let us denote the vector e{]) = ^{j){Yx)j. We will interpret e{j) as the differential 
improvement in the prediction of the global state x at subsystem j. Denoting k{X j, [ j) by Kj, 
note that the control law takes the form u{t) = J^j^p Kje(j). This structural form suggests that 
the controller uses the differential improvement of the global state at the different subsystems 
as the atoms of local control laws, and that the overall control law is an aggregation of these 
local control laws. 

C. Interpretation of O and F 

In this section we explain the role of the propagation filter O and the differential filter F. Due 
to the information constraints in the problem, at subsystem j only states in Tj are available, 
states of other subsystems are unavailable. A reasonable architecture for the controller would 
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involve predicting the unknown states at subsystem j from the available information. This is 
illustrated by the following example. 

Example 4: Consider the system shown in Fig. 2 with dynamics 











. ^3 . 





All 
A22 

A31 A32 A33 



•^3 



+ 



Bn Ml 

522 U2 

B31 B32 B33 M3 



Xl 



. 2:3(1) 

Xl 
. ^3 . 

Fig. 2. Local state information at the different subsystems. The quantities Xi(\) and Xi{2) are partial state predictions of xj,. 

Note that subsystem 1 has no information about the state of subsystem 2. Moreover, the state 
X\ or input Ui do not affect the dynamics of 2 (their respective dynamics are uncoupled). Hence 
the only sensible prediction of X2 at subsystem 1 (which we denote by X2(l)) is X2(l) = 0. 
Subsystem 1 also does not have access to the state X3. However, it can predict X3 based on the 
influence that the state xi has on X3. (Note that both the states xi, X2 and inputs mi, M2 afl'ect 
X3 and M3.) Let us denote X3(l) to be the prediction of state X3 at subsystem 1. Since X2 and 
U2 are unknown, the state X3(l) is a partial prediction of X3 (i.e. X3(l) is the prediction of the 
component of X3 that is affected by subsystem 1). Similarly, subsystem 2 maintains a prediction 
of X3 denoted by X3(2), which is also a partial prediction of X3. Each subsystem thus maintains 
(possibly partial) predictions of unknown downstream states, as shown in Fig. 2. 

In this paper we will not discuss how the state predictions are computed, a detailed discussion 
of the same is available in [16, Chapter 5], [20]. However, we mention that F has an interesting 
related role. At subsystem j the true state xj becomes available for the first time (with respect 
to the subposet Tj)- The quantity qj{j) '■= (rx)y measures the differential improvement in the 
knowledge of state xj, i.e. the difference between the true state xj and its best prediction from 
upstream information. Similarly, it is possible do define qj(i), the differential improvement in 

21 




the prediction of state j at subsystem /. We let 



q(i) = [qj(l)^puh 

so that q(i) corresponds to the differential improvement in state predictions at the i''^ subsystem. 
This q(i) is a vector of length and its components qjii) are differential improvements of Xj 
for j G iii at subsystem /. 
We next examine the role of O. Consider a system of the form: 

r(0 = Hr(t), 

where r(t) G R". Given ri(t) it is possible to compute r2(t), rn(t) by propagation by noting 
that (si - H)r{z) = 0. Rewriting these equations, we obtain that 



sI-Hn -Hn 
—H21 si — H22 



= 



to obtain 



r2 



(s) = (sI-H22r'H2ir,is), 



(21) 



where H22 = £'(^ „|i/£j2,...,„) and H21 = Ej^^ ^^^HEi. The map = (si - //22)"^^2i from ^ to 
r2, . . . ,r„ is simply a propagation of the "upstream" states based on r\. 

It is possible to show [16, Chapter 5], [20] that the differential improvements in the local state 
predictions q{i) obey a decoupled relationship as a consequence of a separation principle. (Thus 
r in (21) corresponds to the differential improvement q{i) for some subsystem /)• As we already 
mentioned (Fx), is the differential improvement in x, at subsystem /. Since O plays the role of 
propagating decoupled local signals, it follows that 0^,(rx), is the differential improvement in 
the prediction of the state Xj for j e Hi at subsystem i. More precisely. 



qjii) = ^ji(rx)i. 



(22) 
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and this can be written compactly as ^(0 = <I)(/)(rx)„ where the column vector q(i) is the 
differential improvement in the global state at subsystem /. As an aside, we mention that the 
states of the optimal controller correspond precisely to these differential improvements qj(i)- 

D. Structure of the Optimal Controller 

Using Theorem 4, the optimal control law can be expressed as: 

u = YjKai,ii)m(rx)i. (23) 

As explained above, 0(i)(rjc), is a vector containing the differential improvement in the prediction 
of the global state at subsystem Each term K(ii, ii)0(i)(Tx)i may be viewed as a local control 
law acting on the local differential improvement in the predicted state. The overall control law 
has the elegant interpretation of being an aggregation of these local control laws. 

Example 5: Let us consider the poset from Fig. 1(d), and examine the structure of the 
controller. (For simplicity, we let Kj = k{ij,ij), the gains obtained by solving the Riccati 
equations). The control law may be decomposed into local controllers as: 



/ 





















/ 
















(rx)2 + ^3 




(rx)3 + K, 




O31 









/ 







O41 




€»42 




O43 




/ 



(rx)4 



X\ 

















^2(1) 




■^2-^2(1) 
















^3(1) 









X3 - ^3(1) 







?4(1) . 




?4(2) 




?4(3) 




{xa - ^4(1)) - ?4(2) - ^4(3) 



Each term in the above expression has the natural interpretation of being a local control signal 
corresponding to differential improvement in predicted states, and the final controller can be 
viewed as an aggregation of these. 

Note that zeros in the above expression imply no improvement on the local state. For example, 
at subsystem 2 there is no improvement in the predicted value of JC3 because the state X2 
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does not affect subsystem 3 due to the poset-causal structure. There is no improvement in 
the predicted value of state at subsystem 4 either, because the best available prediction of X3 

from downstream information TT4 is X3 itself. While this interpretation has been stated informally 
here, it has been made precise in [16, Chapter 5], [20]. 



V. Discussion and Examples 



A. The Nested Case 

Consider the poset on two elements f = ({1,2} , <) with the only order relation being I < 2 
(Fig. 1(a)). This is the poset corresponding to the communication structure in the "Two-Player 
Problem" considered in [22]. We show that their results are a specialization of our general results 
in Section IV restricted to this particular poset. 

We begin by noting that from the problem of designing a nested controller (again we assume 
F = / for simplicity) can be recast as: 



minimize 
Q 



Pn + Pn 


Gil 


1 






. Q21 


Q22 \ 



By Theorem 2 this problem can be recast as: 



mimmize 

Q 



Qn 
Q21 



+ \\Pn+Pl2Q22\ 



We wish to compare this to the results obtained in [22]. It is possible to obtain precisely 
this same decomposition in the finite time horizon where the Hi norm can be replaced by 
the Frobenius norm and separability can be used to decompose the problem. For each of the 
sub-problems, the corresponding optimality conditions may be written (since they correspond to 
simple constrained-least squares problems). These optimality conditions correspond exactly to 
the decomposition of optimality conditions they obtain (the crucial Lemma 3 in their paper). We 
point out that the decomposition is a simple consequence of the separability of the Frobenius 
norm. 

Let us now examine the structure of the optimal controller via Theorem 4. Note that il = 
{1,2} and 12 = {2}. Based on Theorem 3, we are required to solve (K,Q(\)) = Ric(i,l), and 
(/, 2(2)) = Ric(X2). Noting that in this example Fa^-i = -02«-i, a straightforward application of 
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Theorem 4 yields the following: 

Mi(0 = -(Kn + Kn^2^i)xi(t) 

U2(t) = -(K21 + K22^2^l)Xiit) - J{X2{t) - O2iXi(0), 

which is precisely the structure of the optimal controller given in [22], [23]. It is possible to 
show (as Swigart et. al indeed do in [22]) that 02<-i is an predictor of X2 based on xi. Thus the 
controller for ui predicts the state of X2 from x\, uses the estimate as a surrogate for the actual 
state, and uses the gain ^21 in the feedback loop. The controller for U2 (perhaps somewhat 
surprisingly) also estimates the state xj based on x\ using X2 = <l>2i-^i (this can be viewed 
as a "simulation" of the controller for ui). The prediction error for state 2 is then given by 
^2 := X2 - X2 = JC2 - <I>2i^i. The control law for U2 may be rewritten as 

U2 = -(K21X1 + K22X2 + J €2). 

Thus this controller uses predictions of X2 based on x\ along with prediction errors in the feedback 
loop. We will see in a later example, that this prediction of states higher up in the poset is 
prevalent in such poset-causal systems, which results in somewhat larger order controllers. 

Analogous to the results in [22], it is possible to derive the results in this paper for the finite 
time horizon case (this is a special case corresponding to FIR plants in our setup). We do not 
devote attention to the finite time horizon case in this paper, but just mention that similar results 
follow in a straightforward manner. 

B. Discussion Regarding Computational Complexity 

Note that the main computational step in the procedure presented in Theorem 3 is the solution 
of the p sub-problems. The sub-problem requires the solution of a Riccati equation of size 
at most Ii7l«max = 0{p) (when the degree Wmax is fixed). Assuming the complexity of solving a 
Riccati equation using linear algebraic techniques is 0{p'^) [6] the complexity of solving p of 
them is at most O(p^). We wish to compare this with the only other known state-space technique 
that works on all poset-causal systems, namely the results of Rotkowitz and Lall [14]. In this 
paper, they transform the problem to a standard centralized problem using Kronecker products. 
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In the final computational step, one would be required to solve a single large Riccati equation 
of size O(p^), resulting in a computational complexity of 0{p^). 

C. Discussion Regarding Degree Bounds 

It is insightful to study the asymptotics of the degree bounds in the setting where the sub- 
systems have fixed degree and the number of sub-systems p grows. As an immediate consequence 
of the corollary, the degree of the optimal controller (assuming that the degree of the sub-systems 
"max is fixed) is at most O(p^) (since n(ij) < p).ln fact, the asymptotic behaviour of the degree 
can be sub-quadratic. Consider a poset ({!,...,/?},<) with the only order relations being 1 < i 
for all /. Here |X1| = p, and |X/| = 1 for all / 1. Hence, liji ~ P ^ P-> ^iid thus d* < OTmax- In 
this sense, the degree of the optimal controller is governed by the poset parameter ap. 

VI. Proofs of the Main Results 

Proof of Theorem 1: Note that one direction is trivial. Indeed if the (A,, , Bu) are stabilizable, 
one can pick a diagonal controller with diagonal elements Ka such that A„ + BaKa is stable for 
all i e P. This constitutes a stabilizing controller. 
For the other direction let 





Bk 


Ck 


Dk 



be a poset-causal controller for the system. We will first show that without loss of generality, 
we can assume that Ak, Bk, Ck, Dk are block lower triangular (so that K has a realization where 
all matrices are block lower triangular). 

First, note that since K e TCP), Dk e ICP)- Recall, that we assumed throughout that the 
indices of the matrices in the incidence algebra are labeled so that they are consistent with a 
linear extension of the poset, so that Dk is lower triangular. Note that the controller ^ is a block 
px p transfer function matrix which has a realization of the form: 



K = 



Ak 


Bk(1) . 


■ Bk(p) 




Dk(1,1) . 


. Dk(1,p) 


Ck(p) 


Dk(p,1) . 


■ Dk(p,p) 
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Since the controller K G J(^), we have that Kpj = for all j ^ p (recall that p is the cardinality 
of the poset). This vector of transfer functions (given by the last column of K with the (p,p) 
entry deleted) is given by the realization: 



Ck(1) 
Ck(p-I) 



(sI-AKrBK(p) + 



DKihp) 
D Kip -hp) 



= 0. 



Since this transfer function is zero, in addition to DK{j,p) = for all j = I, ... ,p - I, 
it must also be the case that the controllable subspace of (Ak,Bk(p)) is contained within the 



unobservable subspace of 



CK(p-iy 



,Ak\. By the Kalman decomposition 



theorem [5, pp. 247], there is a realization of this system of the form: 



A 


B 


C 


D 



where (A,B,C,D) are of the form: 



B = 

C = 
D = 



All 

A21 A22 
A21 A32 A33 




53 

Ci 



(24) 



As an aside, we remind the reader that this decomposition has a natural interpretation. For 
example, the subsystem 





All 














\ 












Ci 






A21 


A22 












corresponds to the observable subspace, where the system is uncontrollable, etc. (The usual 
Kalman decomposition as stated in standard control texts is a block 4x4 decomposition of the 
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state-transition matriK. Here we have a smaller block 3x3 decomposition because of the collapse 
of the subspace where the system is required to be both controllable and observable). 

Thus this decomposition allows us to infer the specific block structure (24) on the matrices 
C,D). As a result of this block structure, there is a realization of the overall controller 
(Ak, Bk, Ck, Dk), where all the matrices have the block structure 



M 



P-i.i 



M 



Mr, 



M„ 









Mr, 



• • • "^P,P 

One can now repeat this argument for the upper (/?-l)x(/?-l) sub-matrix of K. By repeating 
this argument for first p - I, p - 2, ... ,1 we obtain a realization of K where all four matrices 
are block lower triangular. 

Note that given the controller K (henceforth assumed to have a lower triangular realization), 
the closed loop matrix Ad is given by 



Ad = 



A + BDk BCk 
Bk Ak 



By assumption the (open loop) system is poset-causal, hence A and B are block lower triangular. 
As a result, each of the blocks A+BDk, BCk, Bk, Ak are block lower triangular. A straightforward 
permutation of the rows and columns enables us to put Ad into block lower triangular form where 
the diagonal blocks of the matrix are given by 



Ajj + BjjDKjj BjjCkjj 
Bk Ak 



(25) 



Note that the eigenvalues of this lower triangular matrix (and thus of Ad, since permutations of 
rows and columns are spectrum-preserving) are given by the eigenvalues of the diagonal blocks. 
The matrix Ad is stable if and only if all its eigenvalues are within the unit disk in the complex 
plane, i.e. the above blocks are stable for each j e P. Note that (25) is obtained as the closed-loop 
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matrix precisely by the interconnection of 







I 






with the controller 





Bk 

'^11 




Dk 



Hence, (25) (and thus the overall closed loop) is stable if and only if {Ajj, Bjj) are stabilizable 
for all j G P, and (Akjj, Bkjj, Ckjj, Dkjj) are chosen to stabilize the pair. ■ 
Proof of Theorem 2: If G = [Gi, . . . , G^] is a matrix with G, as its columns, then 



l|G||^ = 2'l<^'ll 



2 

F' 



;=1 



where || • \\p denotes the Frobenius norm. This separability property of the Frobenius norm 
immediately implies the following separability property for the ^2 norm: \f H = {Hi, . . .Hk\ is 
a transfer function matrix with //, as its columns, then 

k ^ k 



JjR ^ JjR ^ 



(In the above jR denotes the imaginary axis in the complex plane). The separability property of 
the 'H2 norm can be used to simplify (9). Recall that /'ii(j), Q{j) denote the columns of Pu 
and Q respectively. Using the separability we can rewrite (9) as 



minimize ZyeP ll^ii(7) + ^'122(7)11' 

Q 

subject to Q(j) e I(ry 
The formulation in (26) can be further simplified by noting that for Qj G I(Py, 

PuQU) = Pniim^'. 



(26) 



(27) 



The advantage of the representation (27) is that, in the right hand side the variable Q^^ is 
unconstrained. Using this we may reformulate (26) as: 



minimize ^^.p ll^ii(7) + PniiM^'f 

Qlj 



(28) 
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Since the variables in the Q^j are distinct for different j, this problem can be separated into p 
sub-problems as follows: 



minimize WPnij) + ^i2(i7)!2-^^ll' 

QlJ 

for all j e P. 



(29) 



Note that each sub-problem is a standard "Hi optimal centralized control problem, and can 
be solved using canonical procedures. Once the optimal Q is obtained by solving these sub- 
problems, the optimal controller may be synthesized using (7). The following lemma describes 
the solutions to the individual sub-problems (13) in Theorem 2. 

Lemma 3: Let (A,B,C,D) be as given in (1) with A,B in the block incidence algebra 
Let 

(^(Ui7XQ(7)) = Ric(i7). (30) 



Then the optimal solution of each sub-problem (13) is given by 







(31) 



(We remind the reader that in the above Ei is the block IX7I x 1 matrix which picks out the 
first column corresponding of the block IX7I x IX7I matrix before it.) 



Proof: The proof follows directly from Lemma 2 by choosing 
H= Pn(j) Pniij) 



Mij, ij) 




BiUij) 


C(i7) 





Diij) 



Lemma 4: The optimal solution to (8) is given by 

Q* = 



A 




Cq 






(32) 



Proof: We note that Lemma 3 gives an expression for the individual columns of Q*. Using 
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Lemma 3 and the LFT formula for column concatenation: 










5i 





Gi G2 












B2 






C2 




D2 



we obtain the required expression. ■ 
Lemma 5: The transfer function matrices O, Y and A^o as given in (18), (20), are in the 
incidence algebra J(P). 

Proof: Let us define block px \ transfer functions as follows: 



0(;) 











I 




Acd(7) 






-k{ij,ij)Ej 



(33) 



Note that K^{j) = K(ij, ij)^(j). Also, note that if / is such that j i i then the i'^ entry of 0(7) 
is zero since the corresponding row of E^^j is zero. By similar reasoning, K^{j) also has this 
property. Thus, when we construct the matrices 



O = 



0(1) 
^.1.(1) 



0(p) 
KM 



by column concatenation, we see that both O e J(!P) and e Ik(P)- Since F = O \ we have 

Lemma 6: The matrix A is stable. 

Proof: Recall that A = diag(A(i7U7) - BUj, UWUj, iDl Since A(i7U7) and BUfij) 
are lower triangular with A^k, B^k, k e ij along the diagonals respectively, we see that the 
pair (A(X7, X7),5(J,7, ^7)) is stabilizable by Assumption 1 (simply picking a diagonal K which 
stabilizes the diagonal terms would suffice to stabilize (A(lj, Ij), B{lj, Ij))). Hence, there exists 
a stabilizing solution to Ric(i7) and the corresponding controller K(ij,ij) is stabilizing. Thus 
Mij, ij) - B(ij, ij)K(ij, ij)) is stable, and thus so is A. ■ 

Given transfer functions M and K, their feedback interconnection is usually described through 
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a linear fractional transformation of the form f(M, K) = Mn+ Mi2K(I - M22K)~^ M21. State space 
formulae for this interconnection are standard [30, pp. 179] and will be useful for evaluating 
several quantities in what follows. 
Lemma 7: Given transfer function matrices M and K with realizations 





A 




B2 


M = 


Ci 


Dn 


Dn 






D21 






Ak 


Bk 


Ck 


Dk 



K 



the Linear Fractional Transformation (LFT) /(M, K) = Mn + M^Kil - M22K)~^ M21 is given by 
the state-space formula 



(34) 



Proof: The proof is standard, see for example [30, pp. 179] and the references therein. 
Proof of Theorem 3: Consider again the optimal control problem (5): 





A + B2DKC2 


B2CK 


Bi + B2DKD21 


f(M,K) = 


BkC2 


Ak 


BkD21 




Ci + DnDKC2 


DuCk 


Dn+DnDKD2i 



minimize ||Pii + Px2K{,I - P2iKrPii\ 

K 



subject to 



K stabilizing. 



(35) 



Let Vj be the optimal value of (35). Consider, on the other hand the optimization problem: 

minimize ||Pii + 



subject to 2 e 



(36) 



Let V2 be the optimal value of (36). Recall that the optimal solution Q* of (36) was obtained 
in Lemma 4 as (32). We note that if K* is an optimal solution to (35) then the corresponding 
Q := K*{I - P2iK*)-^P2i is feasible for (36). Hence v* < v^. We will show that the controller 
in (17) is optimal by showing that Q = Q* (so that v\ = vp. We will also show that K* G I{T) 
and is internally stabilizing. Since it achieves the lower bound and is internally stabilizing, it 
must be optimal. 
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Given K*, one can evaluate Q := K*(I - P22K*) P21. To do so we use K* as per (17) and 



M = 



/ 

P21 P22 



A 


F 


B 








I 


I 









and use the interconnection formula (34) to obtain: 



Q 



A + BCqUi BCq(U2-UiC<s,) 
B(j, A^ — B(^C^ 



F 








CqHi Cq(U2 - Hi Co) 

Recall that Q* given by (32) is the optimal solution to (8) (which constitutes a lower bound 
to the problem we are trying to solve). We are trying to show that it is achievable by explicitly 
producing K* such that Q := K*{1 - P22K*)~^P 21 and Q - Q*, thereby proving the optimality of 

While Q* in (32) and Q obtained above appear different at first glance, their state-space 
realizations are actually equivalent modulo a coordinate transformation. Recall that 112 is a 
matrix (composed of standard unit vectors) that spans the orthogonal complement of the column 



span of Hi. As a result the matrix 



Hi n2 



is a permutation matrix. Define the matrices 



/ -Co, 
/ 



A- 



/ Co 
/ 



Hi Ii2 

Note that A is a square, invertible matrix. Changing state coordinates on Q* using A via: 



A A-^AA 
HiF ^ A-^HiF 
Cq ^ CqK 

along with the relations FII2III + U[ = R, RK- BCq = AR, and ARUi = A, we see that the 
transformed realization of Q* is equal to the realization of Q, and hence Q* = Q. 

Using (4) for the open loop, (17) for the controller and the LFT formula (34) to compute the 
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closed loop map, one obtains that the closed-loop state transition matrix is given by 



A + BCqUi BCq(U2-UiC^) 

By Lemma 6, the closed loop is internally stable. 
By the column concatenation formula and (33) we have 



= A. 







Cd) 


I 







Using the state-space coprime factorization formula [28, pp. 52] it is straightforward to verify 
that for the expression for K* in (17), K* = K^O~^. Since by Lemma 5 both <D G and 
Ko e J(!P), we have K* G ■ 
Proof of Theorem 4: By the column concatenation formula and (33) we have 





B^ 


C(i) 


I 







Using the state-space coprime factorization formula [28, pp. 52] to evaluate K^^~^ we see that 
K* = ^$0~^ This directly gives the first expression in the statement of the theorem. The second 
expression is a simple manipulation of the first expression. ■ 

Vn. Conclusions 

In this paper we provided a state-space solution to the problem of computing an 'K2-optimal 
decentralized controller for a poset-causal system. We introduced a new decomposition technique 
that enables one to separate the decentralized problem into a set of centralized problems. We 
gave explicit state-space formulae for the optimal controller and provided degree bounds on the 
controller. We illustrated our technique with a numerical example. Our approach also enabled us 
to provide insight into the structure of the optimal controller. We introduced a pair of transfer 
functions (0,r) and showed that they were intimately related to the prediction of the state. In 
future work, it would be interesting to attempt to apply this decomposition technique to a wider 
class of decentralization structures. 
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Appendix 

A. Numerical Example 

In this section, we consider a numerical example for the poset shown in Fig. 1(d). The system 
has one state and one input per subsystem, and we synthesize the optimal controller. The data 
for the is the same as in Example 2 with the matrices A, B, C, D as given in (2). For this problem 
the relevant matrices that are used in constructing the controller are: 



Hi = 



El E5 E-j Eg 



R = 



1 





























1 








1 




















1 











1 

















1 





1 





1 


1 



n, = 



Eo E^ Ed Ef, 



(Recall that Et is the 9x 1 unit vector.) Note that il = {1, 2, 3, 4} , [2 = {2,4} , ^3 = {3, 4} , i4 = 
{4}. Accordingly, the Riccati subproblems that we need to solve are given by Ric(X_/) for j = 
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1,2,3,4. Upon solving these, we obtain 



^ai,ii) = 



/5:a3,i3) = 



.7175 .3515 .3616 -.0751 

-.9671 .9575 .1827 .1033 

-1.0306 .2045 1.0312 .0814 

.6337 -.7902 -.8121 .8935 

1.0960 .0792 
-.8226 .9019 



i^a2,i2) = 



1.0237 .0990 
-.8011 .9001 



/s:a4,i4) = .9050. 



From these, it is possible to construct A(l j, i j) - B(i j, i j)K(l j, I j) for y G { 1, 2, 3, 4}, and from 
that construct A = diag(A(i;, i;) - B{ij, ij)K(ij, ij)) given by 



-1.2175 


-.3515 


-.3616 


.0751 

















-.7505 


-1.5589 


-.5443 


-.0282 

















-.6869 


-.5560 


-1.5927 


-.0064 

















-.3535 


-1.7232 


-1.7634 


-1.1032 





























-1.2737 


-.0990 























-1.2226 


-1.0991 





























-1.2969 


-.0792 























-1.2733 


-1.0811 





























-1.0050 



Using (16) one readily obtains A$ and 5$ to be 





-1.5589 


-.5443 


-.0282 










-.7505 













-.5560 


-1.5927 


-.0064 










-.6869 











= 


-1.7232 


-1.7634 


-1.1032 








5<D — 


-.3535 






















-1.0991 










-1.2226 






















-1.0811 










-1.2733 
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Note that given K(i2, ^2) ( a 2 x 2 matrix) one needs to construct ^(^2, 12) (a 4 x 4 matrix) by 
zero padding. For instance, we have 





1.0237 .0990 



-.8011 .9001 



From these, one constructs Cq using (16) to be 



'2 



.7175 
-.9671 
-1.0306 



.3515 .3616 
.9575 .1827 
.2045 1.0312 



.0751 

1033 1.0237 .0990 

0814 1.0960 .0792 

8935 -.8011 ..9001 -.8226 .9019 .9050 



.6337 -.7902 -.8121 

We use these quantities to obtain the controller K* using formula (17). The controller 

K = 



Ak 


Bk 


Ck 


Dk 



has the following realization: 



Ak = 



-1.5589 -.5443 

-.5560 -1.5927 

-1.7232 -1.7634 

1.2226 







1.2733 



-.0292 
-.0064 
-1.1032 











-1.0991 









-1.0811 





-.7505 













-.6869 











Bk = 


-.3535 
















-1.2226 
















-1.2733 
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-.1115 -.3616 .0751 

.0662 -.1827 -.1033 -.0991 

-2.045 .0648 -.0814 -.0792 

-.0109 -.0106 .0115 .0489 .0031 



Dk = 



-.7175 

.9671 -1.0237 

1.0306 

-.6337 0.8011 






•1.0960 










0.8226 -0.9050 



Note that the optimal controller is of degree 5. This matches the bound obtained in Corollary 2 
exactly. Note also that the matrix Dk is in the incidence algebra (and so is the controller K 

itself, as can be verified from the transfer function). Finally, this controller can be verified to be 
stabilizing. Let /Zopen, ^centralized, ^decentralized bc the opcu loop. Optimal Centralized closed loop and 
optimal decentralized closed loop 'H2 norms. We obtain the following values: 



Vn = 31.6319 



^centralized — 2.3197 
^decentralized — 2.8280. 
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